Corpus: aze_wikipedia_2018_1M

Other corpora

2.2.5 Most frequent word beginnings

The most frequent word beginnings as character N-grams for N=1...5 with Zipf's diagram


Zipf's diagram for word beginnings


Gnuplot diagram

Top Characters
word rank frequency n-gram
1 33865 s-
2 32968 m-
3 28253 t-
4 27810 M-
5 27282 a-
Top Character Bigrams
word rank frequency n-gram
1 9772 ya-
2 9656 qa-
3 9485 tə-
4 8115 mə-
5 7364 Ma-
Top Character Trigrams
word rank frequency n-gram
1 2926 yar-
2 2143 qar-
3 1911 ist-
4 1891 pro-
5 1836 kon-
Top Character 4-Grams
word rank frequency n-gram
1 1314 yara-
2 1144 yarı-
3 1033 Qara-
4 941 isti-
5 802 keçi-
Top Character 5-Grams
word rank frequency n-gram
1 748 yarım-
2 691 qeyri-
3 675 dəyiş-
4 631 başla-
5 591 göstə-
11584 msec needed at 2024-01-22 02:22